Measuring thermodynamic length 
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Thermodynamic length is a metric distance between equilibrium thermodynamic states. Among 
other interesting properties, this metric asymptotically bounds the dissipation induced by a finite 
time transformation of a thermodynamic system. It is also connected to the Jensen-Shannon diver- 
gence, Fisher information and Rao's entropy differential metric. Therefore, thermodynamic length 
is of central interest in understanding matter out-of-equilibrium. In this paper, we will consider how 
to define thermodynamic length for a small system described by equilibrium statistical mechanics 
and how to measure thermodynamic length within a computer simulation. Surprisingly, Bennett's 
classic acceptance ratio method for measuring free energy differences also measures thermodynamic 
length. 

PACS numbers: 05.70.Ln, 05.40.-a 



INTRODUCTION 

Thermodynamic length is a natural measure of the dis- 
tance between equilibrium thermodynamic states P, [3, S 
SSIESSSEyjEI, which equips the surface of thermo- 
dynamic states with a Riemannian metric and defines the 
length of a quasi-static transformation as the number of 
natural fluctuations along that path. Unlike the entropy 
or free energy change, which are state functions, the ther- 
modynamic length explicitly depends on the path taken 
through thermodynamic state space. Thermodynamic 
length is of fundamental interest to the generalization of 
thermodynamics to finite time (rather than infinity slow) 
transformations. Minimum distance paths are geodesies 
on the Riemannian manifold and minimize the dissipa- 
tion for slow, but finite time transformations These 
insights have been employed to optimize fractional distil- 
lation and other thermodynamic processes 12, 13, 14]. 

The study of thermodynamic length has largely been 
restricted to the field of macroscopic, endoreversible ther- 
modynamics. However, there are deep connections be- 
tween thermodynamic length, information theory and the 
statistical physics of small systems far-from-equilibium. 
In this paper we will consider the most appropriate def- 
inition of thermodynamic length for small systems and 
how to measure this distance in a computer simula- 
tion. These considerations reveal a surprising connection 
between thermodynamic length, Jensen-Shannon diver- 
gence and Bennett's acceptance ratio method for free en- 
ergy calculations [l^. Bennett's method is an optimal 
measure of free energy differences, but it also indirectly 
places a lower bound on the thermodynamic length be- 
tween neighboring thermodynamic states. 



THERMODYNAMIC LENGTH 

Consider a physical system, possible microscopically 
small, in equilibrium with a large thermal reservoir. The 



configurational probability distribution is given by the 
Gibbs ensemble, [lB| 
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where x is the configuration, t is time, (3 — l/k^T is the 
reciprocal temperature (T) of the environment in natu- 
ral units, (/cb is the Boltzmann constant), Z is the par- 
tition function, and H is the Hamiltonian of the sys- 
tem. This total Hamiltonian is split into a collection of 
collective variables Xi and conjugate generalized forces 
A*, (3H = X^{t)Xi{x). We use the Einstein convention 
that repeated upper/lower indices are implicitly summed. 
The sub-Hamiltonians X are time-independent functions 
of the configurations, whereas the conjugate variables A 
are time dependent and configuration independent. Note 
that the conjugate variables include a factor of inverse 
temperature. 

The A's are the experimentally controllable parameters 
of the system and define the accessible thermodynamic 
state space. For example, in the isothermal-isobaric en- 
semble we have X = {U, V} and A = {/?, where U is 
the internal energy, V is the volume and p is the external 
pressure. Modern experimental techniques have broad- 
ened the range of controllable parameters beyond those 
considered in standard thermodynamics. For instance, 
optical tweezers can apply a constant force to the ends of 
a single DNA molecule. The equilibrium description of 
this system includes the extension of the polymer, with 
the tension as conjugate variable. In computer simula- 
tions we have much greater flexibility. The configuration 
functions can be rather arbitrary collective variables de- 
lineating high dimensional manifolds of equilibrium ther- 
modynamic states. 

The partition function that normalizes the probability 
distribution, Z, is directly related to the free energy F 
(Gibbs potential), the free entropy ip (Massieu potential) 
and entropy S: 
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Angled brackets indicate an average over the appropriate 
equilibrium ensemble. The first derivatives of the free 
entropy give the first moments of the collective variables, 



(3) 



and the second derivative yields the covariance matrix, 

= = - (^.»(^. - (^.») • 

(4) 

The covariance matrix gij is positive semi-definite and 
varies smoothly from point to point, except at macro- 
scopic phase transitions. Therefore, we can use the co- 
variance matrix as a metric tensor and naturally equip 
the manifold of thermodynamic states with a Rieman- 
nian metric. Recall that a metric provides a mea- 
sure of 'distance' between points. It is a real func- 
tion d{a,b) such that (1) distances are non-negative, 
d(a, b) > with equality if and only if a = &, (2) sym- 
metric, d{a, b) — d{b, a) and (3) it is generally shorter 
to go directly from point a to c than to go by way of 
b, d{a,b) + d{b,c) > d{a,c) (The triangle inequality). 
Moreover, in a Riemannian metric we can measure the 
distance along curves connecting different points. The 
length of a curve parameterized by t, from to r, is 
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and the point-to-point distance is the length of the short- 
est curve. Curves of locally minimal distance are called 
geodesies, and are the closest analogs of straight lines 
in a curved space. Because of the connection to fluctu- 
ations [Eq. ([U] the length of curves in thermodynamic 
state space are measured by the number of natural fluc- 
tuations along the path. The larger the fluctuations the 
closer points are togetherJlTl. [l8| . 

Originally, Weinhold [l|] defined the thermodynamic 
length C using the second derivatives of the internal en- 
ergy U{S, V, N) with respect to the extensive variables 
as a metric tensor, and by Ruppeiner 0] using the cor- 
responding derivatives of the entropy, S{U,V,N). Us- 
ing intensive variable derivatives of the free energy was 
first discussed by Schlogl 0, [13, [lH. For macroscopic 
thermodynamic systems these different definitions of the 
metric are essentially equivalent [1, analogously to 
the macroscopic equivalence of ensembles. However, in 
small systems these metrics are in general different and 
the Weinhold and Ruppeiner metrics may not exist, since 
the second derivatives of the entropy and entropy are not 
guaranteed to be positive. The definition adopted in this 
paper [Eqs. (|4|)], essentially that of Schlogl, does not re- 
quire the thermodynamic limit. 

Moreover, with this definition we can make an impor- 
tant connection to statistical estimation theory, since the 



thermodynamic metric tensor Eq. ([4]) is then identical to 
the Fisher information matrix J^]. 
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According to the Cramer-Rao inequality the variance of 
any unbiased estimator is at least as high as the inverse 
of the Fisher information [l^. 

In 1945 Rao introduced the 'entropy differential met- 
ric', the distance between two distributions arising from 
the Riemannian metric over the parameter space with 

23,E|. 



This en- 



the Fisher information metric tensor 
tropy differential metric is identical to the thermody- 
namic length when, as here, the variables are conjugate 



parameters of a Gibbs ensemble Note that if we 

plug the Fisher information metric tensor [Eq. ([6|)] into 
the curve length [Eg. (|5l)] we can rewrite the entropy dif- 
ferential metric as Id, IitI 
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We should probable consider Rao's definition as more 
general and fundamental than the thermodynamic defini- 
tion, just as the statistical definition of entropy is widely 
considered more general and fundamental than the origi- 
nal thermodynamic definition. In particular, the entropy 
differential metric natural extends to the situation where 
the Hamiltonian is not a linear function of the control 
parameters, or where the system is not in thermal equi- 
librium. 

We can also define a related quantity, the thermody- 
namics divergence of the path. 



J = T 



dX' 
~dt 
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dt 
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In Riemannian geometry /2t is called the energy, or 
action, of the curve, due to similarity with the kinetic 
energy integral in classical mechanics. The length and 
divergence are related by the inequality. 



(9) 



which can be derived as a consequence of the Cauchy- 
Schwarz inequality f'^dt g'^dt > [J^fgdt] with 
g{t) = 1. The value of the divergence depends on the 
parametrization. The minimum value is attained only 
when the integrand is a constant along the path. 

Thermodynamic length and divergence control the dis- 
sipation of finite time thermodynamic transformations as 
we approach the infinity slow quasi-static limit 0, 0, [1] . 
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Consider a protocol that perturbs the conjugate vari- 
ables of the system from A i to A at in a series of discrete 
steps [1] . After each step we pause and allow the system 
to reequilibrate. After we get to the final thermodynamic 
state, we run the protocol in reverse, until we again reach 
the initial thermodynamic state. 

The total average change in entropy of a single step is 



AS, 



total 



system " 
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t+i 



i/t+1 



Thus, 



the hysteresis, the total average dissipation of the com- 
bined forward and backwards protocols, is 



JV-l 



= ^ (Aj+i [{X,)t+i - {X,)t] + Aj [{X,)t - {X,)t+i]) , 
t=i 

N-l 

= E - - {x,)t\ , (10) 

t=i 

N-l 

= ^ AA^A(X,) , 



which we can also write as 

T AA* MX,) , , , 

where t — N6t. In the continuum limit we can replace 
the sum by an integral and find that 

N-l 



lim N ^ AA^A(X,) = r f ^9^J^dt = J (12) 
7^1 "'0 dt dt 



N 



As the number of steps along a path increases we ap- 
proach a reversible, quasi-static process. In this limit, 
the hysteresis scales as the thermodynamic divergence 
and inversely as the number of steps. (Note that this ex- 
pression differs by a factor of 2 from Ref. Q because here 
we have considered the hysteresis, the combined dissipa- 
tion of the forward and reversed protocols, rather than 
the dissipation along a single direction.) Similar reason- 
ing relates the divergence and the hysteresis of a slow, 
finite time transformation [3]. 

The asymptotic hysteresis and thermodynamic diver- 
gence of a protocol will depend on the parametrization 
of the path. However, thanks to the length-divergence 
inequality J > C? [Eq. ([9])] we know that the minimum 
thermodynamic divergence of the path is the square of 
the thermodynamic length. Repeating the previous anal- 
ysis, we find that the thermodynamic length is related to 
the cumulative root mean single-step hysteresis. 



hm N V JA\^A{Xi) ^ C 
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Consequently, we can locate optimal, minimal dissipation 
paths connecting two thermodynamic states by measur- 
ing and optimizing the thermodynamic length. 



MEASURING THERMODYNAMIC LENGTH 

Thermodynamic length and divergence are clearly of 
fundamental interest and importance to non-equilibrium 
thermodynamics. Therefore, we shall consider how best 
to measure these quantities. The relation between dissi- 
pation and divergence [Eqs. (fT^ and ([T^ ] suggests one 
obvious approach. We run equilibrium simulations at 
a series of points along the path and examine the scal- 
ing of the dissipation with the number of steps. Since 
length and divergence are properties of the path taken 
through thermodynamic state space, but are indepen- 
dent of the underlying dynamics of the system, one can 
measure thermodynamic length in a computer simula- 
tion using whatever dynamics is most convenient, be it 
Metropolis Monte Carlo, Langevin dynamics or deter- 
ministically thermostated molecular dynamics. The only 
condition is that the chosen dynamics reproduce the cor- 
rect equilibrium ensemble, Eq. ([1]). 

Concretely, we must measure A(A'i), the mean change 
of the collective variables between neighboring thermody- 
namic states. Given K uncorrelated measurements from 
an equilibrated computer simulation we can estimate this 
value as 

A(A,) = Y,v{A^2)X,{x) - ^p(x|Ai)A,(a;) (14) 



p(a;|Ai) 

K 

~ E ^M,"(exp (aV'12 - (A^2 - Ai)A,- - 1) 

In the second line we rewrite the difference of the means 
as the mean difference. (We should not estimate the dif- 
ference of the mean directly since this will lead to large 
statistical errors that will become larger as the number of 
steps increases.) The final line follows from the definition 
of the Gibbs ensemble, Eq. ([1]). Here, Xi^t,k is A:th mea- 
surement of the ith collective variable, Xiix) taken from 
an equilibrium system defined by the conjugate variables 
At, and A^i2 = V'2 — is the difference in free entropy. 

To employ Eq. (|14p we need to know the free entropy 
change, A'(/'i2, which can be opti mally estimated using 
Bennett's acceptance ratio method 15,22,23|. Given iiT 
measurements from each of two neighboring states, Xi^.k 
and Ai_2,fc the log likelihood I that the free entropy has 



a particular value is [22l \1_ 



£(Aiji2) = — V In . : : ^ 

^fri 1 + exp (-A^Ai2 + (A^ - Al)X,,i,fc) 

1 ^ 1 

+ — 7 : : T (15) 

Kf^^ 1 + exp (-AV'21 + (Al - A|)A,,2.fc) 
and the Bennett optimal estimate of AV'12 maximizes this 
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likelihood. (See [24] for a clear and concise exposition of 
this result.) 

Rather than using this free entropy measurement to es- 
timate the mean change in the collective variables using 
Eq. (|14p . we will instead show that the Bennett likelihood 
is directly related to the thermodynamic divergence. If 
we insert the Gibbs ensemble [Eq. ([T])] into the log like- 
lihood, then in the large sample limit, we find that the 
likelihood scales as 

^(AV'iz) ^2if(jS(/;p2)_ln2) (16) 

where JS(p^;p^) is the Jensen-Shannon divergence, the 
mean of the relative entropy of each distribution to the 
mean distribution [23 |. 

(17) 

The minimum divergence is zero for identical distribu- 
tions and the maximum is In 2. The square root of the 
Jensen-Shannon divergence is a metric between proba- 
bility distributions [25|. However, unlike a Riemannian 
metric, the Jensen-Shannon metric space is not an in- 
trinsic length space. There may not be a mid point b be- 
tween points a and c such that d{a, b) + d{b, c) = d{a, c) 
and consequentially we cannot naturally measure path 
lengths. However, on any metric space we can define a 
new intrinsic metric by measuring the distance along con- 
tinuous paths. The Jensen-Shannon di verg ence between 
infinitesimally different distributions is [26l | 

JSip;p + dp) = lY.^^- (18) 

If we compare with Eq. we can see that in the con- 
tinuum limit 

£ = VS y" dVjS and ^ ^ J • (1^) 

The induced Jensen-Shannon metric is proportional to 
the thermodynamic (entropy differential) metric, and 
the induced Jensen-Shannon divergence is proportional 
to the thermodynamic divergence. Consequentially, the 
square root of Jensen-Shannon divergence between two 
thermodynamic states gives a lower bound on the ther- 
modynamic length of any path between those same 
states, and the Jensen-Shannon divergence is a lower 
bound to the thermodynamic divergence. 

To summarize, we can measure the thermodynamic 
length and minimum thermodynamic divergence along 
a path in thermodynamics state space by adapting Ben- 
nett's method. We perform a series of equilibrium simu- 
lations along the path and find the maximum likelihood 



free entropy change [Eq. (fT5)) ] and Jensen- Shannon di- 
vergence [via Eq. (jl6p ] between neighboring ensembles. 
The cumulative Jensen-Shannon metric along the path 
provides a lower bound to the thermodynamic length 
[Eq. (jl9p ] and a lower bound to the minimum divergence 
of the path [via Eq. ©J. This procedure is then repeated 
with finer discretizations of the path, until the estimates 
of divergence and length converge. 

This research was supported by the Department of En- 
ergy, under contract DE-AC02-05CH11231. 



* Electronic address: IGECrooks@lbl.govl 
[1] F. Weinhold, J. Chem. Phys. 63, 2479 (1975). 
[2] G. Ruppeiner, Phys. Rev. A 20, 1608 (1979). 
[3] P. Salamon and R. S. Berry, Phys. Rev. Lett. 51, 1127 
(1983). 

[4] P. Salamon, J. Nulton, and E. Ihrig, J. Chem. Phys. 80, 

436 (1984). 
[5] F. Schlogl, Z. Phys. B 59, 449 (1985). 
[6] P. Salamon, J. D. Nulton, and R. S. Berry, J. Chem. 

Phys. 82, 2433 (1985). 
[7] J. D. Nulton and P. Salamon, Phys. Rev. A 31, 2520 

(1985). 

[8] J. Nulton, P. Salamon, B. Andresen, and Q. Anmin, J. 

Chem. Phys. 83, 334 (1985). 
[9] H. Janyszek and R. Mrugala, Phys. Rev. A 39, 6515 

(1989). 

[10] R. Mrugala, J. D. Nulton, J. C. Schon, and P. Salamon, 

Phys. Rev. A 41, 3156 (1990). 
[11] D. Brody and N. Rivier, Phys. Rev. E 51, 1006 (1995). 
[12] P. Salamon and J. D. Nulton, Europhys. Lett. 42, 571 

(1998). 

[13] M. Schaller, K. H. Hoffmann, G. Siragusa, P. Salamon, 

and B. Andresen, Comp. Chem. Eng. 25, 1537 (2001). 
[14] J. D. Nulton and P. Salamon, J. Non-Equilib. Thermo- 

dyn. 27, 271 (2002). 
[15] C. H. Bennett, J. Comput. Phys. 22, 245 (1976). 
[16] H. B. Callen, Thermodynamics and an Introduction to 

Thermostatistics (Wiley, New York, 1985), 2nd ed. 
[17] W. K. Wootters, Phys. Rev. D 23, 357 (1981). 
[18] B. Andresen, R. S. Berry, R. Gilmore, E. Ihrig, and 

P. Salamon, Phys. Rev. A 37, 845 (1988). 
[19] T. M. Cover and J. A. Thomas, Elements of Information 

Theory (Wiley, New York, 1991). 
[20] C. R. Rao, Bull. Calcutta Math. Soc. 37, 81 (1945). 
[21] J. Burbea and C. R. Rao, J. Multivariate Anal. 12, 575 

(1982). 

[22] M. R. Shirts, E. Bair, G. Hooker, and V. S. Pande, Phys. 

Rev. Lett. 91, 140601 (2003). 
[23] P. Maragakis, M. Spichty, and M. Karplus, Phys. Rev. 

Lett. 96, 100602 (2006). 
[24] J. Lin, IEEE Trans. Info. Theory 37, 145 (1991). 
[25] J. Endres, D.M. Schindelin, IEEE Trans. Info. Theory 

49, 1858 (2003). 
[26] A. Majtey, P. W. Lamberti, M. T. Martin, and A. Plas- 

tino, Eur. Phys. J. D 32, 413 (2005). 



